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ABSTRACT 

We study the class of quasi-alphabetic relations, i.e., tree transformations defined 
by tree bimorphisms (if,L,ip) with tp,^ quasi-alphabetic tree homomorphisms and L a 
regular tree language. We present a canonical representation of these relations; as an 
immediate consequence, we get the closure under union. Also, we show that they are 
not closed under intersection and complement, and do not preserve most common oper- 
ations on trees (branches, subtrees, t)-product, f-quotient, /-top-catenation). Moreover, 
we prove that the translations defined by quasi-alphabetic tree bimorphism are exactly 
products of context-free string languages. We conclude by presenting the connections 
between quasi-alphabetic relations, alphabetic relations and classes of tree transforma- 
tions defined by several types of top-down tree transducers. Furthermore, we get that 
quasi-alphabetic relations preserve the recognizable and algebraic tree languages. 

Keywords: regular tree language, tree homomorphism, tree bimorphism, tree transducer 
1. Introduction 

Tree transformations were extensively study in the past four decades from the 
algebraic point of view offered by tree bimorphisms [3, 6, 22, 24, 25] or from the 
dynamic point of view provided by tree transducers [4, 9, 11, 17, 18]. Recently, new 
types of tree transducers were used with considerable success in modeling trans- 
lations between natural languages especially because of their ability to capture 
syntax-sensitive transformations and to do certain reorderings of parts of sentences. 
This way, the new field of syntax-based machine translation was established (see 
[12, 14, 15, 16] and the references therein). Unfortunately, properties that may im- 
prove the translation process (e.g., closure under composition and preservation of 
recognizable and algebraic tree languages [14, 15]) do not hold in general for most 
of the main tree transducer types [5, 9, 11, 16]. 

First proposed as models of a compiler [13], synchronous grammars represent 
classes of tree transformations that describe in a natural way translations between 
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natural languages [1, 19, 20, 21]. It consists of two formal grammars with produc- 
tions linked by some criteria, pairs of recursively related sentences being generated 
simultaneously. This way, not only that they model the syntax-sensitive transforma- 
tions between natural languages, but moreover, they intrinsically describe (perform) 
difficult local rotations required by natural language pairs with extremely different 
structures such as Arabic-English or Chinese-English. Unfortunately, the mathe- 
matical framework offered by such formalisms is quite poor since for example, no 
closure results were known [20]. 

An elegant algebraic way to define tree transformations is by the tree bimorphism 
formalism which is formed by two tree homomorphisms defined on the same common 
tree language. Tree bimorphisms were used with considerable success in proving 
properties Hke closure under composition and preservation of recognizability, espe- 
cially when suitable restrictions were imposed on its constituents [3, 6, 22, 24, 25]. 
Moroovor, by taking tho yiolds of the input trees and output trees, they arc trans- 
formed into word-for-word translation devices. A survey on the main classes of tree 
bimorphisms and their characteristics is [29]. 

Using the tree bimorphism formalism, S.M. Shieber was the first one who 
linked tree transducers and synchronous grammars in an attempt to improve the 
mathematical framework of the later devices [20, p. 95]: "...the bimorphism char- 
acterization of tree transducers has led to a series of composition closure results. 
Similar techniques may now be applicable to synchronous formalisms, where no 
composition results are known..." Following this lead, the class of quasi-alphabetic 
tree bimorphisms that define the same translations as syntax-directed translation 
schemata of [1] was introduced in [23]. In [28] connections between these tree bi- 
morphisms and other synchronous grammars are presented in detail, and similar 
results involving other types of tree bimorphisms are summed up in [29] . 

It was already shown in [23] that the tree transformations defined by quasi- 
alphabetic tree bimorphisms, called quasi- alphabetic relations here, are closed under 
composition and inverses, and preserve the recognizability of tree languages. In the 
present work we further investigate the properties of this class from a theoretical 
point of view (What other closure properties has or what common operations on 
trees are preserved? Is there any canonical representation of such a class? What 
is its place in the tree transducer hierarchy?) but also having in mind their con- 
nection with synchronous grammars (What other mathematical properties can be 
transferred to several types of synchronous grammars? What is the power of the 
translations defined by quasi-alphabetic bimorphisms?). 

Our results can be summarized as follows. After presenting in Section 2 the 
basic definitions and notions used, we show in Section 3 that there is a canonical 
representation of quasi-alphabetic relations, and by using it, that these relations are 
closed under union. Also, we show that they are not closed under intersection and 
complement. After this, we turn our attention to what common operation on trees 
are preserved by such relations: we found out that intersection and reunion with 
a regular language are preserved, but branches, subtrees, w-product, ii-quotient, /- 
top-catenation are not, in general. We end Section 3 by proving a more general result 
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of [23]: the translations defined by quasi- alphabetic tree bimorphisms are exactly the 
products of context-free string languages. Section 4 is dedicated to the connection 
of quasi-alphabetic relations with other well-known classes of tree transformations: 
alphabetic transductions [6], finite-state relabelings [9], tree transformations defined 
by several types of top-down tree transducers [9] and top-down tree transducers 
with look-ahead [10]. All the results are depicted in the Hasse diagram of Figure 1. 
Moreover, as an immediate consequence of the fact that quasi-alphabetic relations 
are strictly included in the class of alphabetic ones, we get that our class preserves 
recognizable and algebraic tree languages, too. 

2. Preliminaries 

Let R, S, and T be sets, and consider a relation t C S x T. The fact that 
{s,t) € T can also be expressed by writing s t t. For every s G S,\et st = {t \ s t t}. 
More generally, for every A C S, we let At = UaeA '^^^ inverse of r is the 
relation = {(t.s) \ s t t}. The composition of two relations p C R x S and 
T C S X T is the relation p o t = {{r,t) \ 3s G S: r p s t t}. The identity 
relation ids is {(SjS) j s S S}. If S is understood, then we simply write id. For 
(total) mappings ip: S T we generally identify sip and ip{s) for every s £ S. The 
nonnegative integers are denoted by N. For every k €N, the set {i € N | 1 < i < A;} 
is denoted by [k]. 

For a set V, we denote by V* the set of strings over V, and e denotes the 
empty string. By an alphabet we mean a finite set of symbols. A ranked alphabet 
(I],rk) consists of an alphabet S and a mapping rk: S ^ N. Often we leave the 
mapping rk impHcit. For every fc > 0, let S^, = {/ e S | rk(/) = k}. We may write 
T, = {/i/fci, . . . , fn/kn} to indicate that S consists of the symbols /i, . . . , /„ with 
the respective ranks fci , . . . , A:„ . 

Let S be a ranked alphabet and T a set. Then 

E(r) = {/(*!,..., ifc) I / eSfc,ti,...,ffc GT} . 

For every (leaf) alphabet V, the set Te(V) of all E-trees indexed by V is the smallest 
set T such that V C T and T.{T) C T. Subsets of Ts(F) are called (tree) languages. 
Generally, for all considered trees we assume that the ranked alphabet is disjoint 
with the leaf alphabet. For every tree t G Tj:{V), the set pos(t) C N* of positions 
of t is inductively given by pos(t;) = {e} for every v gV, and 

pos(/(ti, . . . , ifc)) = {e} [J{iw\ie [k],w G pos(tj)} 

for every / G and ti,...,tk G T^iV). The label of t at position w G pos(t) 
is denoted by t{w), the subtree of t at w is denoted by t\w, and the replace- 
ment of that subtree in t by the tree u G ?s(V) is denoted by ^[m]^). For every 
n C E U y let posQ(t) = {w G pos{t) I t{w) G fi} and poSf{t) = pos^^fy{t) for every 
/ G Suy. The set of branches oft is br(i) = pos2^uy(0) the set of subtrees of 
t is sub(t) = {t\m I w G pos(t)}. Finally, \t\f = card(posy^(t)), and the height hg{t) 
is the length of a longest string in pos(t). 
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A tree t € T^{V) is linear (respectively, nondeleting) in F C 1/ if \t\y < 1 
(respectively, \t\y > 1) for every y GY. The Y-yield of a tree t G T's(F) is defined 
inductively by ydy(?y) = y for every y S y, ydy(w) = e for every u G F \ y, and 
ydy(/(ii,- --jifc)) = ydy(ti) • • •ydy(tfe) for every / G Sfe and ti,...,tk G ^^(y). 

We fix a set X = {xi | i > 1} of formal variables (disjoint to all other ranked 
alphabets and leaf alphabets). Let n > 0. We let X„ = {xi | i G [n]} and 

Cg(y) = G Ts(y U X„) I Vz G [n] : |tU, = 1} . 

In particular, the elements of C^{V) are called contexts. For every t G Ts(y UX„), 
let var(i) = {x G X„ | pos^(f) ^ 0}. 

For all t, ii, . . . , i„ G T-^iV U X„), we denote by t\ti, . . . , i„] the result ob- 
tained by replacing, for every i G [n], every occurrence of Xi in t by ti. For all 

C Ts(y UX„), L[Li,...,L„] denotes UteL,fieii,...,f„eL„ • • • ' 
Let n = \t\v. More generally, for every v & V , the result of replacing, for every 
i G [n], the j-th (with respect to the usual lexicographic order on the positions) 
occurrence of v by ti is denoted by t[v <— {t\, . . . ,tn)]. For every t G T^{V U X„), 
/ G El, and c G C^(F), we let f {t) = t and (7° = C, and Z'^+HO = filHt)) and 
C"=+i = C[C*=] for ah k>0. 

For every / G Sfc and C rs(V U X„), the f -top- catenation of 

Li,...,Lfe is f{Li,...,Lk) = UtieLi,...,tfceLfc/(^ii---'^fe)- Moreover for every 
V G 1^, the v-product L i' of two languages L, L' C Tj:{V) is 

L»^L'= U (ti,...,t„)] . 

teL,n=|t|„ 
Vje[n] : UeL' 

Then, the v-quotient of i by L' is = {t G rE(F) | {t} L' D L ^ %}. For a 

more detailed description of those operations on tree languages, we refer the reader 
to [6]. 

A {tree) homomorphism ip: Tj:{V) —^ Ta{Y) can be presented by a mapping 
ipv '■ V — > T^{Y) and mappings y>/j : S/j — > Ta{Y U X^) for every fc > as follows: 

(i) Vip = ipv{v) for every v gV, and 

(ii) /(ii, . . .,tk)'fi = (pk{f)[ti(p, . . .,tkip] for every / G Sfc and ti,...,tke Tj:{V). 

We say that it is normalized if for every / G Sfe there exists n > such that 
ydxi^kif)) = Xi . . . Xn- Moreover, such a homomorphism ip is 

• linear [11, 6, 7] (respectively, complete [7]) if <^/s(/) is linear (respectively, 
nondeleting) in Xk for every f GT,k, 

• symbol-to-symbol [7] if ipv{v) G F for every v £ V and Vfe(/) G A(Xfe) for 
every f eT,k, 

• alphabetic [6, 2] (demarquage lineaire in [2]) if it is linear, ipv{v) G F for every 
V eV, and <y7fc(/) € XkU A{Xk) for every / G S^, and 
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• strictly alphabetic [6] if it is complete, alphabetic and symbol-to-symbol. 

We denote by IH, cH, ssH, aH, and saH the classes of all linear, complete, 
symbol-to-symbol, alphabetic, and strictly alphabetic tree homomorphisms, respec- 
tively. Further subclasses of tree homomorphisms can be obtained by combining 
any of these restrictions. For example, IcH is the class of all linear complete tree 
homomorphisms. 

A (tree) bimorphism is a triple B = {(p,L,ip) where L C Tr{Z) is a tree lan- 
guage, ip: Tr{Z) T^{V), and ip: Tr{Z) —^ Ta{Y) are homomorphisms. The tree 
transformation defined by B is 

tb = ° idz, oip = {{tip, tip) \ t G L} . 

The translation defined by B is 

yd(TB) = {{ydy{tip),ydYm) \ t e L} = {(yd^W, yd^(«)) | {t,u) G tb} . 

For all classes Hi and 7^2 of homomorphisms and every class C of tree languages, 
we denote by B{Hi, C, 'H2) the class of tree transformations tb where B = (ip, L, ip) 
with p ^ TLi, L £ C and %jj G 7^2- 

A top-down tree transducer [18, 26] is a tuple M = (Q, E, A, /, iJ) where 

• Q = Qi is a unary ranked alphabet of states disjoint with S U A, 

• S and A are an input and an output alphabet, respectively, 

• / C Q is a set of final states, and 

• i? is a finite set of rules of the form I ^ r where / S Q{Tj:(X)) is linear in X 
and r G TA(Q(var(0)). 

The top-down tree transducer M = {Q,'E,A,I,R) is linear (respectively, nondelet- 
ing) if r is linear (respectively, nondeleting) in var(Z) for every rule I ^ r € R. The 
one-step derivation relation =>m is defined as follows. For every s,t & Ta{Q{T^)) 
we have s =>m t if and only if there exists a rule Z — > r G i?, a position w G pos(s), 
and Ui,. . . ,Un G Ta where n = rk(s(w)) such that s\m = l[ui, . . . , u„] and t = s[u]w 
with u = r[ui, . . . , u„]. Let be the reflexive and transitive closure of =^m- The 
tree transformation computed by M is 

TM = {{s,t) G Ts X Ta I 3g G J: q{s) t} . 

The class of all tree transformations computable by linear (respectively, linear and 
nondeleting) top-down tree transducers is denoted by 1-TOP (respectively, In-TOP). 

Let M = {Q,T,,A,I,R) be a top-down tree transducer. It is a finite-state 
relabeling [9], if every rule Z ^ r G i? is of the form I = q{f{xi, . . . ,Xk)) and 
r = g{qi{xi), . . . ,qk{xk)) for some q,qi,...,qk G Q, f G Sfe, and g G Afc. If 
additionally, 1(1) = r^e) for every I ^ r G R, then M is a. finite- state tree automaton 
(fta) [9]. We generally write rules of an fta in the form q — > f{qi, ■ ■ ■ ,qk) instead 
of q{f{xi, . . . , Xk)) — > f{qi{xi), ■ ■ ■ , qk{xk))- Note that tm coincides with idi, for 
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some -L C Te, if M is an fta. This L is also denoted by L{M), and additionally, for 
every q the notation L{M)g stands for L{N) where N = (Q, S, A, {g}, i?). A 
language L is recognizable if there exists an fta N such that L{N) = L. The class 
of recognizable tree languages [11, Chapter II] is denoted by Rec. Finally, M is a 
relabeling [9] if it a finite-state relabeling and card((3) = 1. We denote the classes of 
transformations computed by finite-state relabehngs, relabehngs, and fta by QREL, 
REL, and FTA, respectively. 

The top-down tree transducer M can be equipped with a look-ahead facility [10, 
16]. The pair {M,c) where M = (Q,S, A,/, i?) is a top-down tree transducer and 
c: R ^ V{Ty.) is called a top-down tree transducer with look-ahead. The look- 
ahead c is regular (or recognizable) , if c{l r) is recognizable for every I ^ r E R, 
and it is finite, if c{l ^ r) G L^^, . . . ,T^] for a finite tree language L C Ty,{X). 
In the latter case, we often write c{l r) = L. The transducer (M, c) inherits 
the properties 'linear' and 'nondoloting' from M. The semantics of a top-down tree 
transducer (M, c) with look-ahead is defined as for the top-down tree transducer M 
with the additional condition that e c(Z — *■ r) in the definition of =^m- The 
class of transformations computed by linear top-down tree transducers with finite 
(respectively, regular) look-ahead is denoted by 1-TOP^ (respectively, 1-TOP^). 

3. Properties of Quasi- Alphabetic Relations 

Let us start by recalling the main notion of this contribution. A quasi-alphabetic 
homomorphism is linear, complete, and basically symbol-to-symbol, but allows vari- 
ables as successors of an output symbol. The precise definition follows. 
Definition 1 (see [23, Section 3]) A tree homomorphism (p: Ts(y) T^(Y) is 
quasi-alphabetic if 

(i) it is linear and complete, 

(a) (pv{v) € Y for every v gV , and 

(Hi) ^kif ) e A{Y U Xk) for every f e^k- 

By qaH we denote the class of all quasi- alphabetic homomorphisms. A quasi- 
alphabetic bimorphism is a bimorphism {tp, L, i/j) such that ip and V' are quasi- 
alphabetic and L is recognizable. 

The name 'quasi-alphabetic' deserves some discussion. They are called such 
because they are almost 'alphabetic' in the sense of [11] (a relabeling in our termi- 
nology). Note that we here use the notion of 'alphabetic' that is used in [6]. Thus, 
with our terminology in mind, we might have called them 'quasi-relabelings'. How- 
ever, the term 'quasi-alphabetic' is estabHshed [23] and we continue to use it. In 
particular, i3(qaH, Rec, qaH) is the class of all the tree transformations defined by 
quasi-alphabetic bimorphisms; such relations are called quasi- alphabetic relations. 

Every quasi-alphabetic homomorphism maps each input symbol to an output 
symbol possibly with some output leaf variables as direct subtrees. However, the 
variables of X have to occur as direct subtrees of the root output symbol. This 
immediately yields the following proposition. 
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Proposition 1 Let if : T^{V) T^{Y) be a homomorphism and t G Tx;(F). 

• If is quasi- alphabetic, then hg(t) < hg{tLp) < h.g{t) + 1. 

• If if is symbol-to-symbol, then ]ig{ttp) < hg{t). 

• If ip is strictly alphabetic, then hg{tip) = hg{t). 

Now we investigate the fundamental properties of quasi-alphabetic relations. We 
start our investigation with a canonical representation of quasi-alphabetic relations 
in the spirit of [6, Proposition 3.1]. This representation will allow us to conclude 
that quasi-alphabetic relations are closed under union. 

For the rest of this section, let B = {(p,L,'tp) with (p: Ty{Z) — > T^iV) and 
Tp: Tr{Z) — !■ Ta{Y) be a quasi-alphabetic bimorphism. Let [S x A] be the ranked 
alphabet such that for every fc > 

[i:xA]k = {{t,u)\t€i:{vuXk)ncl,{V),uGA{YuXk)nci{Y)} . 

There are canonical quasi-alphabetic homomorphisms : T[-exA\{V x y) — > Tj:{V) 
and rpxA](^ xY)^ Ta{Y) given by 

PvxY{{v,y)) =v pl{{t,u))=t 

PvxY{{v,y)) =y pl{{t,u)) = u 

for every {v,y) € V xY and {t,u) e [S x A]fe. Henceforth, we will use these 
projections also for other product ranked alphabets. 

Proposition 2 (see [6, Proposition 3.1]) There exists a quasi- alphabetic homo- 
morphism rj: Tr{Z) — > T[2xA](^ x ^) such that ttp = {tT])p^ and tip = {tr])p'^ for 
every t G Tt{Z). 

Proof. Let rj: Ty{Z) T'pxA](^ ^) be the tree homomorphism such that 
Vz{z) = {(pz{z),tpziz)) for every z € Z and %(/) = {(pk{f)Ak{f)) for every 
f €Tk. Clearly, rj is quasi-alphabetic, and it is easy to check that tip = {tri)p^ and 
# = {tr))p'^ for every teTr{Z). □ 
Using the previous proposition, we can now eliminate from B the ranked alpha- 
bet r, the index set Z, and the particular tree homomorphisms (p and tp. Essentially, 
every quasi-alphabetic relation r C Ts {V) x Ta {Y) is determined by a recognizable 
language L C TpxA](l^ x Y). 

Theorem 1 (see [6, Proposition 3.1]) A relation t C T^(V) x Ta{Y) is quasi- 
alphabetic if and only if there exists a recognizable language L C TpxA](l^ x Y) 
such that T = {{tp^, tp^ \ t &L}. 

Proof. The if-direction is trivial since (p^, L, p^) is a quasi-alphabetic bimorphism 
defining r. For the converse, let B = (p.L'.iJ.)) be a quasi-alphabetic bimorphism 
such that tb = t. By Proposition 2 there exists a quasi-alphabetic homomorphism 
T]: Tr{Z) T[^xA]{V x Y) such that r = {{trjp^ ,tr]p^) | t e L'}. Consequently, 
the language L = r]{L') has the desired properties because it is recognizable by [11, 
Theorem n.4.16]. □ 
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We immediately note that quasi-alphabetic relations are trivially closed under 
inverses [23, Theorem 4]; i.e., if r G S(qaH, Rec, qaH), then so is . As promised, 
let us use the previous theorem to prove that quasi-alphabetic relations are closed 
under union. 

Corollary 1 (cf. [6, Proposition 3.2]) i3(qaH, Rec, qaH) is closed under union. 
Proof. Let ti,T2 C Tj:{V) x Ta{Y) be quasi-alphabetic relations. By Theorem 1, 
there exist recognizable Li,L2 C Tpxn](^ such that 

n = {{tp^,tp^) \ tGLi} and T2 = {{tp^,tp^) \ tGL2} . 

Then 

T1UT2 = \ tGLi}u{{tp\tp^) \ tG L2} = {{tp\tp^) \ tGL1UL2} , 

which proves that ri U T2 is quasi-alphabetic by Theorem 1 (because Li U L2 is 
recognizable by [11, Theorem II. 4. 2]). □ 
Let us move on to closure under intersection. For closure under intersection, we 
would need to align the two input homomorphisms and the two output homomor- 
phisms at the same time and enforce equality both-sided. The next theorem shows 
that we are not able to do this and hence quasi-alphabetic relations are not closed 
under intersection. 

Theorem 2 Any class C of tree transformations such that 

IcssH C C C e(H, Rec, IH) 
is not closed under intersection. 

Proof. Let E = {f /2, g/1, e/0}. We consider the linear complete symbol-to- 
symbol homomorphisms ipi,tp2- Is Ts that are defined by 

if'iif) = fixi,X2) ipi{g) ^ g{xi) -01(6) =e 

ip2{f) = f{x2,xi) 'ip2{g) = gixi) -02(6) =e. 

Clearly, tjji and V-'2 belong to C. Let us consider the context C = g{xi) of C^. We 
observe that for every to, n > 

,f(,g'"(e),.9"W)V^i=./(C'"[e],C"[e]) 
/(5"(e),r(e))V2 = /(C"[e],C""[e]) . 

Let L = {/((7™(e), g"(e)) \ m,n > 0}. Clearly, L is a recognizable language. 
Assume that there exists t G S(H,Rec, 1H) such that r = 0i fl V'2- Since such 
bimorphisms preserve recognizable languages [11, Theorems II.4.2, II.4.16, II.4.18], 
the image t{L) should be recognizable. But, t{L) = {/(C"[e], C"[e]) | n > 0}, 
which is not recognizable. Hence no r with the given properties exists, which 
proves the statement. □ 
Corollary 2 (of Theorem 2) ,B(qaH, Rec, qaH) is not closed under intersection. 
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Finally, we note that ;B(qaH, Rec, qaH) is trivially not closed under complemen- 
tation by Proposition 1. Let us consider now common operations on trees. We 
immediately observe that intersection of a quasi- alphabetic relation with id^ where 
L is a recognizable language is again a quasi-alphabetic relation. Also the union 
with idi is a quasi- alphabetic relation because idi is a quasi- alphabetic relation 
for every recognizable language L and quasi-alphabetic relations are closed under 
union by Corollary 1. 

In general, the tree transformations sub and br (if we consider the branches as 
trees over an ranked alphabet of figures of rank and 1) are not quasi-alphabetic. 
Moreover, for L C T^{V) recognizable and v € V, also the following relations 
r and p, which are defined for every t G T^{V) by tr = t •v L and tp = t/yL, are 
not quasi-alphabetic, in general (cf. [6, Proposition 4.2 & p. 191-200]). All these 
can easily be proved using Proposition 1. Moreover, in general, quasi-alphabetic 
relations arc not closed under /-top-concatenation (cf. [6, Proposition 3.6]). 

Now, let us turn our attention to the translations computed by quasi-alphabetic 
bimorphisms. In [23] it was shown that they define the syntax-directed trans- 
lations [1]. Here we prove a more general result: the translations computed by 
quasi-alphabetic tree bimorphisms are exactly the products of context-free string 
languages (for definitions and details about context-free string languages the reader 
is referred to [11, Section 1.6]). 

Theorem 3 (cf. [6, Proposition 3.6]) For all context-free string languages Ki 
and K2 over the same alphabet V, there exists a quasi- alphabetic bimorphism B 
such that yd(rB) = Ki x K2. 

Proof. By [11, Corollary 2.4], there exist recognizable tree languages L\ C Te(I^) 

and L2 C Ta(F) such that {ydy(ti) | ti e LJ = fsTi and {ydy{t2) \ ^2 £ ^2} = K2. 
Let (p: Y ^ V be a bijection, and Y be disjoint with SUA. Then extend (j) to 
(j)^: EUF EUV' and 4>a - AuF AUF such that (ji^sls = ids and (ji^A^ = idA- 
We denote the ranked alphabets S U F and A U F, in which all symbols of Y are 
nullary, by S and A, respectively. Next, we define the ranked alphabet 

SvA = {(/,5)|/GS,5eA} 

such that rk((/, 5)) = max(rk(/), rk((7)). In a similar way the ranked alphabets 
S V A and E V A are defined. Without loss of generality, we can assume that 
EoT^yT^Aoand Ei ^ / Ai. 

Next we show how to embed a tree of T^iV) into T^yA- Roughly speaking, 
we read off the first components of the symbols of E V A while neglecting the 
additional subtrees. However, we need to make sure that the neglected subtrees 
contain no symbols of Y because the quasi-alphabetic homomorphism cannot ignore 
the additional subtrees, but should clearly not produce a piece of output string 
for them. To this end, we define the linear top-down tree transducer My. with 
regular look-ahead c such that My. = ({*},S V A, E U V,{-k],R), and for every 
(/, 5) G (E V A)fc we have the rule 

r = *{{f,g){xi, Xk)) (l>Y{f){-*{xi), . . . ,*(Xi.k(/))) 
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with look-ahead c(r) = {f,g){Ti,...,Tk) in R, where Ti = • • • = T^^f) = Tg^A 
and Trk(/)+i = • • ■ = 7fc = TsvA- In analogous way the top-down tree trans- 
ducer Ma with regular look-ahead is defined. Let L = r^^(_Li) n r^^(_L2), which 
is recognizable by [11, Corollary IV. 3. 17 and Theorem II. 4. 2]. Next, we take the 
quasi-alphabetic homomorphism (p: T^^^ ^eva(^)j which is defined for every 
(/,5)e(SvA)fc by 



Vk{{f,g)) = 



{f,9)ixi,---,Xm) if /eSrr» 

{h\,h2){(j){f)) otherwise 



where (/ii,/i2) G Si x Ai is arbitrary. Similarly, the quasi-alphabetic homomor- 
phism ip: TgyA ~* ^sva(^) is defined. Now if we take the quasi-alphabetic 
bimorphism B = {(p,L,ip), it should be clear that ydy{tip) — ydy(tTAfj,) and 
ydy(iV) = ydy (tTMi) for every t e Tg^A- Consequently, yd(TB) = Ki x K2, 
which concludes our proof. □ 

4. Relation to Other Classes 

In this section, we relate the class of quasi-alphabetic relations to other known 
classes of tree transformations. We focus on classes of transformations defined by 
bimorphisms [3, 8, 7] and classes of transformations computed by various top-down 
tree transducers [18, 26, 11]. Clearly, every strictly alphabetic (alphabetic in [11]) 
homomorphism is quasi-alphabetic and thus ;B(saH, Rec, saH) C ;B(qaH, Rcc, qaH). 
We start by showing that the class QREL of transformations computed by finite- 
state relabellings [9] is included in the class S(saH, Rec, saH). 
Proposition 3 QREL C B(saH, Rec, saH). 

Proof. Let r € QREL. Since QREL C In-TOP = REL o FTA o IcH [9, The- 
orem 3.5], there exists a relabeling M such that tm Q Is(l^) x Tr{Z), a rec- 
ognizable tree language L C Tr{Z), and a linear and complete homomorphism 
ip: Tr{Z) Ta{Y) such that t = {(tT^/,*^) I t £ LjMoreover, by the construc- 
tions of [9], ijj is symbol-to-symbol and Tj^^ : Tr{Z) Te(V) [i.e., r^^ is computed 
by a deterministic relabeling]. Consequently, Tj^^ and ip are strictly alphabetic 
because every deterministic relabeling is strictly alphabetic. Thus, the strictly al- 
phabetic bimorphism [r^l.L.ij)) defines r. □ 
The next proposition shows that every quasi-alphabetic relation can be com- 
puted by a linear top-down tree transducer with finite look-ahead [16]. With that 
we estabhsh rough lower and upper bounds to the power of quasi-alphabetic bimor- 
phisms. 

Proposition 4 ^(qaH, Rec, qaH) C 1-TOP^. 

Proof. Let us consider a quasi-alphabetic bimorphism B = {(p, L, Tp) where 
ip: Tr{Z) Tj:{V) and ip: Tt{Z) Ta(F). Without loss of generality, let tp be 

normalized. Moreover, let N = (Q, T U Z,r U Z, I, R) be an fta recognizing L. We 
construct the linear top-down tree transducer M with finite look-ahead c such that 
M = (Q, S U V, A U y, 7, R') and 
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• for every transition q ^ z G R with z € Z, we have the rule r = q{z(p) zip 
with look-ahead c(r) = {a;i} in R', and 

• for every transition q f{qi, ■ ■ ■ , qk) G R with / e Ffc and qi, . . . ,qk Q we 
have the rule 

= <liy^kif)is){xi,. ..,Xn))^ '<Pkif)[qiixj,), . . ■,qk{xj^)] 

with look-ahead c(r) = {fkif)} in R' , where ji = pos^. {(pk{f)) for every 
i G [k]. 

First, let us prove tb C tm by showing q{t(p) ttp for every q G Q and 

f G L{N)q. Let t G Z'. Then (/(tt^) =>m iV" using a rule constructed in the first item. 
Now let t = f{ti, . . . ,tk) for some f G and ti, . . . G Tr{Z). Moreover, let 
^i, • • • , 5fc G Q be such that U G L{N)q. for every i G [k] and q — > /(gi, . . . ,qk) G R. 
Then 

• • • , </c)</') = q{'^k{I)[h'^, tk'A) 

= Qigiuiihlfi, . . . , tkf], Un[tl(p, tkf])) 

where (pk{f) = g{ui,...,Un) for some g G S„ and Wi,...,u„ G TE(y). Let 
ji = l?os^.{(pk{f)) for every i G [k]. Then 

. . .,tk)(p) V'fe(/)[9i(%i [^i</5, ■ • ■,tk<f]), • ■ • , gfeKfc [iify^, • • ■ , tkf])] 

using a rule constructed in the second item. Note that the look-ahead restriction is 
trivially fulfilled. Clearly, Wj. — Xi for every i G [k] and thus we have 

q{f{ti,---,tkM =>M ipkiDiqiih^), ■ ■ ■ ,qk{tkip)] ■ 

By the induction hypothesis, we have qi{ti(p) titjj for every i G [k]. Conse- 
quently, we obtain 

q{t<fi) =>M Vfe(/)[9i(*iV),---,9fe(ife</5)] '4'k{f)[titlJ,---,tk'4'] =ttp . 

This proves the auxiliary statement and tb C tm if we consider states of /. 

The converse inclusion can be proved using the statement: For every q G Q, 
t G TY,{y), and u G Ta(F), if q{t) u, then there exists s G L{N)g such that 
t = Sif and u = sip. This can be proved by induction on the length of the derivation 
in M. We omit the details here. □ 

Next let us show that the class of alphabetic relations is essentially different 
from the classes of transformations computed by top-down tree transducers. For 
the specific class TOP this was already remarked in [6] and here we only refine this 
statement to the statements necessary for our purposes. 
Proposition 5 S(aH,Rec,aH) g l-TOP"^ and In-TOP % 6(aH, Rcc, aH). 
Proof. It is known that 1-TOP^ equals 1-BOT, which is the class of all tree trans- 
formations computable by linear bottom-up tree transducers [27, 9]. As claimed 
in [6, page 188], the class S(aH, Rec, aH) is incomparable to BOT, which is the 
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class of all tree transformations computed by bottom-up tree transducers. Con- 
sequently, B(aH,Rec,aH) % 1-TOP^. Moreover, it is known that IcH C In-TOP. 
Suppose that IcH C S(aH, Rcc, aH). Then also every hnear and complete inverse 
homomorphism can be implemented by an alphabetic bimorphism because alpha- 
betic relations are trivially closed under inverses. However, the proof of the main 
theorem in [3, Section 3.4] then shows that alphabetic relations are not closed under 
composition. This contradicts [6, Theorem 5.2], thus IcH % S(aH, Rec, aH). This 
yields In-TOP % ^(aH, Rec, aH). □ 
Next we consider the relation of quasi-alphabetic and alphabetic relations. We 
show that every quasi- alphabetic relation is also alphabetic (ala [6]). The strictness 
of this inclusion can be obtained using Proposition 5. 
Theorem 4 S(qaH, Rec, qaH) C S(aH, Rec, aH). 

Proof. Let us take a quasi-alphabetic tree bimorphism B = {tp, L, ip) where 
Lp: Ty{Z) T^{V) and ip: Tr{Z) T^(Y). Without loss of generality, let 
y 7^ 7^ y. Let S V A be the ranked alphabet introduced in [6, Section 2], 
and moreover let v ^ V and y ^ Y. We construct the linear tree homomor- 
phism p: Tr{Z) Tj:wa{V x Y) such that pz{z) = {z(p,zilj) for every z G Z 
and 

Pkif) = {t{£)w,u{e)yj'){xi, . . . ,Xk,ti, . . . ,ti) 
for every f ^Tk where 

• t = Pkif) and u = tjJkif), 

• {ii, ...,im}= POSy(t) and {ji, . . . , j„} = posyiu), 
I = max(m, n) and 



ta 



for every a G [I], and 



{t{ia), u{ja)) if a < min(m, n) 
{t{ia),y) iin<a<m 
{v,u{ja)) i{m<a<n 



• w = ■ ■ -Wk+m and w' = w'^, . . . are such that t{wa) = pk{f){a)'K\ for 

every a G [A: + m] and t{w\^) = pki.f)ib)Tr2 for every b G [k + n] where tti and TT2 
are the usual projections to the first and second components, respectively, with 
XTTi = X = X1T2 for every x G X. 

By [11, Theorem n.4.16], p{L) is recognizable. An easy proof shows that 

TB = {t(pi:,t(pA) I t G p{L)} 

where c^s and </?a are the canonical alphabetic homomorphisms of [6, Section 2]. 
Hence, rg is an alphabetic relation by [6, Proposition 3.1]. □ 

As an immediate consequence of Theorem 4, we get the fohowing result. 
Corollary 3 (see [6, Proposition 3.7]) Quasi- alphabetic relations preserve the 
recognizable tree languages and the algebraic tree languages. 
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B(aH, Rec, aH) 1-TOP^ 
1-TOP^ 




B(qaH, Rec, qaH) 1-TOP 

In-TOP 




S(saH, Rec, saH) 

QREL 
Figure 1: Hasse diagram. 

Finally, we need to show that linear top-down tree transducers are not sufficiently 
powerful to implement all quasi-alphabetic relations. 
Proposition 6 B(qaH, Rec, qaH) % 1-TOP. 

Proof. Let E = {//2, e/0} and V = {vx,V2\. Moreover, let : Te ^ be a 

quasi-alphabetic tree homomorphism with </3o(r) = f{vi. v-i). Then B — {(p, {e}, id) 
is a quasi-alphabetic tree bimorphism that defines {{f(vi,V2),e)}. It is known [9, 
Example 2.6] that tb is not in 1-TOP, and hence S(qaH, Rec, qaH) ^ 1-TOP. □ 
Let us collect our results in a Hasse diagram (see Figure 1). Note that in such 
a diagram every edge is oriented upwards and denotes strict inclusion. 
Theorem 5 Figure 1 is a Hasse diagram. 

Proof. The following six statements are sufficient to prove the claim. 

QREL C B(saH, Rec, saH) C B(qaH, Rec, qaH) C S(aH, Rec, aH) (1) 



S(saH, Rec, saH) C In-TOP C 1-TOP C l-TOP'" C 1-TOP^ (2) 

S(qaH, Rec, qaH) C 1-TOP^ (3) 

B(qaH, Rec, qaH) ^ 1-TOP (4) 

In-TOP 2 S(aH, Rec, aH) (5) 

S(aH, Rec, aH) ^ 1-TOP^ (6) 



Statement 1 is mostly clear using Proposition 3. The strictness is due to the fact 
that QREL is closed under intersection whereas this is not true for S(saH, Rec, saH) 
by Theorem 2. The final inclusion of (1) is proved in Theorem 4. The inclusions 
of (2) are all obvious and (3) is shown in Proposition 4. Finally, the inequality (4) 
is proved in Proposition 6 and inequalities (5) and (6) are proved in Proposition 5. 
□ 
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